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Abstract 

Despite substantial rate gains achieved by coordinated transmission from a massive amount of 
geographically distributed antennas, the resulting computational cost and channel measurement overhead 
could be unaffordable for a large-scale distributed antenna system (DAS). A scalable signal processing 
framework is therefore highly desirable, which, as recently demonstrated in |22) , could be established 
based on the concept of virtual cell. 

In a virtual-cell based DAS, each user chooses a few closest base-station (BS) antennas to form its 
virtual cell, that is, its own serving BS antenna set. In this paper, we focus on a downlink DAS with 
a large number of users and BS antennas uniformly distributed in a certain area, and aim to study the 
effect of the virtual cell size on the average user rate. Specifically, by assuming that maximum ratio 
transmission (MRT) is adopted in each user’s virtual cell, the achievable ergodic rate of each user is 
derived as an explicit function of the large-scale fading coefficients from all the users to their virtual 
cells, and an upper-bound of the average user rate is established, based on which a rule of thumb is 
developed for determining the optimal virtual cell size to maximize the average user rate. The analysis 
is further extended to consider multiple users grouped together and jointly served by their virtual cells 
using zero-forcing beamforming (ZFBF). In contrast to the no-grouping case where a small virtual cell 
size is preferred, it is shown that by grouping users with overlapped virtual cells, the average user rate 
can be significantly improved by increasing the virtual cell size, though at the cost of a higher signal 
processing complexity. 
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Index Terms 
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(MRT), zero-forcing beamforming (ZFBF). 


I. Introduction 


The fast growing demand for data rate has posed unprecedented challenges for designers of 
next-generation (5G) mobile communication systems. To support a large number of mobile users 
with high-data-rate applications such as online gaming and streaming high-definition video, many 
new innovative solutions have been proposed (see [1] for a comprehensive overview). In general, 
there are two main ways to improve the data rate: 

1) Utilizing more bandwidth: By moving to the millimeter wave (mmWave) spectrum which 
was widely regarded as unsuitable for mobile communications until recently, large amounts 
of new bandwidth are now becoming available. 

2) Employing more base-station (BS) antennas: The spectral efficiency can be greatly im¬ 
proved by increasing the spatial dimensions according to the Multiple-Input-Multiple- 
Output (MIMO) theory [J2|, [|3j. Significant gains have been demonstrated by adopting 
a massive number of either co-located BS antennas 0-0 (popularly known as massive 
MIMO ) or distributed BS antennas [|5)-[jT5|. 

Among the new solutions, the distributed antenna system (DAS) is gaining increasing momen¬ 
tum, and emerging as a highly promising candidate for the 5G mobile communication systems 
In a DAS, many low-power remote antenna ports are geographically distributed over 
a large area and connected to a central processor by fiber. The appealing features of distributed 
antennas have attracted considerable attention from both industry and academia, and been applied 
to the cutting-edge technologies such as small cellfj^ and the Cloud Radio Access Network (C- 


RAN) [201. 


1 The cellular system with small cells can be regarded as a special case of DAS if replacing each remote antenna port by a 


mini BS and requiring that each mobile user only communicates to the closest mini BS. 
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A. BS Antennas: Distributed or Co-located 


Compared to antenna arrays, the use of distributed antennas provides a much more efficient 
utilization of spatial resources. Specifically, signals to/from distributed BS antennas are subject 
to independent and different levels of large-scale fading. In contrast to the co-located case where 
the channel from the BS antenna array to each user becomes increasingly deterministic as the 
number of BS antennas grows, the channel randomness caused by the small-scale fading can be 
always preserved even with a large amount of distributed antennas. As a result, if the channel 
state information (CSI) is available at the transmitter side, the enhanced channel fluctuations can 


be fully utilized to provide higher water-filling gains and multiuser diversity gains [21], 

More importantly, with distributed BS antennas, the minimum access distance of each user 
can be greatly reduced by increasing the density of BS antennas. As a result, different rate 
scaling laws have been observed in cellular systems with co-located BS antennas or uniformly 


distributed BS antennas [22j—[24). For instance, it was shown in [24] that by assuming that 
the number of BS antennas and the number of users grow infinitely with a fixed ratio v, the 
asymptotic downlink average user rate with maximum ratio transmission (MRT) scales in the 
orders of log 2 v and | log 2 v for the co-located and distributed BS antenna layouts, respectively, 
where a > 2 is the path-loss factor. The rate gains achieved by distributed BS antennas become 
even more prominent when an orthogonal precoding scheme such as zero-forcing beamforming 
(ZFBF) is adopted. 

Note that the above results are obtained by assuming a coordinated transmission of all the BS 
antennas. In a large-scale DAS with hundreds (or even thousands) of geographically distributed 
BS antennas, the computational cost and CSI measurement overhead of such a joint processing 
would be prohibitively high. How to establish a scalable signal processing framework for DASs 
is a key challenge that needs to be addressed. 


B. Virtual-Cell based DAS 

To reduce the system complexity, the common practice of cellular networks is to divide a 
large area into a number of cells where a BS is placed at the center of each cell and serves 
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users who fall into its coverage. As users are randomly located in each cell which have distinct 
access distances to their serving BS, there are always users at cell boundary areas who suffer 
from lower power efficiency and higher inter-cell interference. The performance disparity of 
users could even be exacerbated if distributed BS antennas are used in cellular networks, as 
the BS antennas at cell boundary areas suffer from significantly higher inter-cell interference 
than those at the cell center in the uplink p2|, and become strong interfering sources to the 


neighboring cell-edge users in the downlink [231. As pointed out in [22], the cell-edge problem 
has its roots in the cellular structure where cells are formed based on the coverage of each BS. 
Such a BS-centric structure, nevertheless, cannot be justified when both users and BS antennas 
are scattered around. Instead, the signal processing may be performed based on the unit of 


“virtual cells” [25[ |27|. 

In a virtual-cell based DAS, each user chooses its own serving BS antenna set as its virtual 
cell. The reason why virtual cells are formed in a user-centric manner is two-fold: 1) With a high 
density of distributed BS antennas, each user may usually find more than one BS antennas in 
its vicinity to communicate with; 2) The BS antenna set serving each user varies with the user’s 
location. By choosing a few surrounding BS antennas as each user’s virtual cell, the number of 
channels that need to be measured becomes small, leading to a much lower CSI measurement 
overhead compared to cellular systems with large antenna arrays. More importantly, by increasing 
the density of distributed BS antennas, both the average number of users who share their virtual 


cells and the average number of users served by each BS antenna decline |22|, indicating that 
the signal processing in a large-scale DAS can be performed in a local and scalable manner. 

Note that a few concepts were proposed before which share similarities to virtual cell, but with 
remarkable differences. In cellular networks with coordinated multipoint (CoMP) transmission, 
for instance, clusters of geographically distributed BSs are also formed to serve the cell-edge 
users. In most studies, however, the clustering is performed in a BS-centric manner with a fixed 
and a-priori BS coordination pattern [(28]l-[[32|, which still leads to “cluster-edge” users [29], 


. Dynamic BS clustering was recently proposed in [331, [34| as a way of forming coordinated 
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BS clusters based on users’ locations. Yet different from the idea of virtual cell, it still follows 
the conventional cellular structure where each user is associated and served solely by its closest 
BS, though BSs coordinate their transmissions by choosing beamforming vectors to null out the 
intra-cluster interference. 

Such a BS-centric clustering is also highly inefficient for a large-scale DAS with more BS 
antennas than users where some BS antennas may become redundant as no users are nearby. 
In this case, the BS antenna selection should be performed from users’ perspective. Different 
optimization frameworks were recently established for a if-user DAS with L K distributed 

by 


BS antennas to maximize the weighted sum rate [351 or average energy efficiency 
optimizing BS antenna selection, transmission power and precoding vectors. Instead of choosing 
BS antennas for each user to form a virtual cell, however, a joint antenna selection is performed 
for all the users. 


C. Effect of Virtual Cell Size 

For virtual-cell based DASs, the virtual cell size, i.e., how many BS antennas should be in¬ 
cluded into each user’s virtual cell, is a key system parameter. In this paper, instead of formulating 
a joint optimization of virtual cell selection and precoding vectors which, similar to the problems 
considered in [35], P6), would be intractable due to its non-convex and combinatorial nature, we 


consider a relatively simpler question: for given precoding scheme, what is the optimal virtual 
cell size? 

Specifically, we focus on a large-scale downlink DAS where K 1 users and L > 1 BS 
antennas are uniformly distributed in a certain area, and each user chooses V closest BS antennas 
as its virtual cell. We aim to analyze the effect of the virtual cell size V on the average user rate 
performance under two typical scenarios: 1) each user is served by its virtual cell independently 
of others, and 2) multiple users are grouped together with joint transmission from their virtual 
cells. 


In the no-grouping case, MRT is assumed to be adopted in each user’s virtual cell, and the 
corresponding achievable ergodic rate of each user is derived as an explicit function of the 
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large-scale fading coefficients from all the users to their virtual cells. As the average user rate is 
determined by the joint probability density function (pdf) of the large-scale fading coefficients 
that is intractable when the number of BS antennas L and the number of users K are large, an 
upper-bound of the average user rate is obtained to analyze the effect of the virtual cell size V, 
based on which a rule of thumb for the optimal virtual cell size V* is further developed. It is 
shown that V* is solely determined by the ratio of the total number of BS antennas L to the 
number of users K, which is much smaller than L when K is large. Simulation results verify 
that the rule of thumb provides a good estimation for the optimal virtual cell size to maximize 
the average user rate. 

Due to no cooperation among virtual cells, users suffer from significant interference, which 
greatly degrades the rate performance in the first scenario. Therefore, we further consider 
grouping multiple users and adopt ZFBF to eliminate the intra-group interference. A novel user 
grouping algorithm is proposed, where users whose virtual cells overlap are grouped together. 
In contrast to the conventional BS-centric clustering where users located at the cluster edge still 
suffer from much degraded performance due to strong interference from BSs in neighboring 
clusters, with the proposed virtual-cell based user grouping, all the users achieve relatively 
uniform rate performance with the lowest rate significantly improved. It is further shown that 
different from the no-grouping case where a small virtual cell size is preferred, in this case, the 
virtual cell size V determines a tradeoff between the average user rate and complexity: with a 
larger V, the average user rate can be improved owing to a lower interference level, yet more 
users need to be jointly served, leading to higher signal processing complexity. 

The remainder of this paper is organized as follows. Section II introduces the system model. 
The rate analysis with MRT adopted in each user’s virtual cell is presented in Section III. User 
grouping is further considered in Section IV, and concluding remarks are summarized in Section 
V. 

Throughout this paper, the superscript f and T denote conjugate transpose and transpose, 
respectively. E[-] denotes the expectation operator. [•] denotes the ceiling operator. ||x|| denotes 
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Fig. 1. Illustration of a downlink virtual-cell based DAS. x represents a user and Y represents a distributed BS antenna. Each 
user selects V closest BS antennas to form its virtual cell. 


the Euclidean norm of vector x. I k denotes a K x K identity matrix. 1, x /, denotes a 1 x L 
vector with all entries one. Olxi denotes an L x 1 vector with all entries zero, x ~ CAT(u,a 2 ) 
denotes a complex Gaussian random variable with mean u and variance a 2 . \X\ denotes the 
cardinality of set X. 


II. System Model 

Consider a downlink virtual-cell based DAS with K users and L BS antennas uniformly 
distributed in a circular area with unit radius. Denote the set of users as /C and the set of base- 
station (BS) antennas as B, with |/C| = K and \B\ = L. Assume that each user is equipped with 
a single antenna. As Fig. jT] illustrates, each user selects V closest BS antennas to form its virtual 
cel 10 For user k G /C, denote its virtual cell as V/, with V/,. = V. 

Let us first assume that each user is served by its virtual cell independently of others. This 
assumption will be relaxed in Section IV, where multiple users are grouped together and jointly 

2 Note that in general, each user may choose a different number of BS antennas to form its virtual cell. In this paper, we aim 
to analyze the effect of virtual cell size on the rate performance. Therefore, a homogeneous scenario where all the users have 
the same virtual cell size is considered. 


served by their virtual cells. Let us focus on the downlink performance of user k. The received 
signal of user k can be written as 

Vk — gfe,V fc x fc,V fc + ^2 S k,Vj X j,Vj + n k, (1) 

j&K,j^k 

where x^y, G C v xl is the transmitted signal vector from the virtual cell 14 to user k G 1C. 
n k ~ CAf(0,N 0 ) is the additive white Gaussian noise (AWGN) at user k. gfc i y fc GC lxV denotes 
the channel gain vector from the virtual cell 14 to user k, which can be written as 


g k,v k - lk,v k ° h k,v k , 


( 2 ) 


where h fc,Vfe e C 1 x v ' denotes the small-scale fading vector with entries modeled as independent 
and identically distributed (i.i.d) complex Gaussian random variables with zero mean and unit 
variance. Jk.v k £ R 1 x v " is the large-scale fading vector from the virtual cell 14 to user k. Without 
loss of generality, we ignore the shadowing effect and model the large-scale fading coefficient 
as 


7 k,l — 



a 

2 


1 


(3) 


where a is the path-loss factor, rf is the position of the BS antenna l G B and is the position 
of user k E 1C. o represents the Hadamard product. 

Moreover, we assume that full channel state information (CSI) is perfectly known at both 
the transmitter side and the receiver side. With linear precoding, the transmitted signal from the 
virtual cell 14 to user k can be written as 


Xfc,V fe = w k,v k ■ s k , (4) 

for any k G JC, where s k ~ CJ\f(0. P) is the information-bearing signal and w k ,v k is the precoding 
vector with ||w fci yj| = 1. The total transmission power for each user is assumed to be fixed at 

P. 

The second term on the right-hand side of (jTj), i.e, u k = YhjeKj^k g k,v^j,Vj, is the interference 
received at user k. With a large number of users and BS antennas, u k can be modeled as a complex 
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Gaussian random variable with zero mean and variance J fc , which can be easily obtained from 
0 as 

T \ TTT* [ t ..t 1 7=1 \ \ 2 i=. 

(5) 


h — ^ E 


g*,v, w y Vj wt v .gJ )V .J P = a R ' ' P - 

jC.KJ/k /C.V, 


where 


( 6 ) 


Hi = E h,. v . [HA 2 ] , 

with E/cv, a j,i = 1 for an y J e /C. 

In this paper, we normalize the total system bandwidth into unity and focus on the spectral 
efficiency. The achievable ergodic rate of user k 6 /C can be then written as 

pg,, Vfc w fc , Vfc wi iVfc gi iVfe 


74 = E h 


log 2 1 + 


N 0 + Ik 


(7) 


where the expectation is taken over the small-scale fading matrix H = [h( Vi , lr[ V2 , • • • , h Jcv K \ ■ 
Let 


f^k 


P ||7fc, 


Vfel 


( 8 ) 


No + Ik 

denote the average received signal-to-interference-plus-noise ratio (SINR) of user k. By substi¬ 
tuting ([8]) into 0, the achievable ergodic rate Ft,}- can be further written as 

log 2 (l + /ifcgfc,v*w fci v fc wJ jVfc gJ jV J , (9) 


Rk — Eh 


where 


S k,v k - Pk,V k ° h 


k,V k 


( 10 ) 


denotes the normalized channel gain vector from the virtual cell V/, to user k, and (3k,v k is the 
normalized large-scale fading vector with entries 


(3k,l — 


7 k,l 


II7 k, 


(ID 


■Vfe | 


l G Vfe. Obviously, we have Yhiev k ALi = 1 f° r an y k £ K,. 

It can be clearly seen from 0 that the rate performance is closely dependent on the precoding 
scheme adopted by each user’s virtual cell. In the next section, we will take the example of MRT 
to demonstrate the effect of virtual cell size V on the rate performance. 











10 


III. Rate Performance without User Grouping 
In the single-user case, it is well known that MRT is optimal for achieving the system capacity 


137). In this section, we assume that each user is served by its virtual cell using MRT. For any 
user k e /C, the precoding vector is given by 


sU 


( 12 ) 


w fc,v & — ,| 

In the following, we will first derive the achievable ergodic rate of each user as an explicit 
function of the large-scale fading coefficients. 


A. Achievable Ergodic Rate /?/.. with MRT 

By substituting (fU|) into ([9]), the achievable ergodic rate of user k with MRT can be obtained 


as 


Rk = E hfejVfc [log 2 (1 + hk\\gk,v k \\ 2 )] , (13) 

where the average received SINR ///,. can be obtained by combining (|5]), ([8]) and ([12]) as 

li k = - - Wj^L --- ( i4) 


p + Yhjetc,j^k ^icVj ^ ( Tj,u {Tj,i}i i ti,i£V j ) ' 


with T (x; bi, b 2 , ■ ■ ■ , by- 1 ) defined as 


1 if V = 1, 

T (x, b\,b 2 , , by—l) — y_i x~ 2 bt^{\ogx~ 2 -\ogb^?-l)+bt^ y,y-i b ^ 2 

2^m=l (z-2-b- 2 ) 2 l U=l,t^m ft" 2 _fe- 

(15) 


otherwise. 


Appendix [A] presents the detailed derivation of ( [T4| ). 

As the normalized channel gain ||gfc,v fc || 2 is a hypoexponential random variable with the 
corresponding probability density function (pdf) given by 


P. 


-2 

k,q 


f\\gk,v k \\ 2 ( X ) = J2 Pkj eX P {“AcM II o-2_o-2» 

ZSVfc q&V k ,q& Pk ’<l 1 k ’ 1 


(16) 
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(a) (b) 

Fig. 2. Achievable ergodic rate R k of a randomly selected user k versus virtual cell size V under 3 realizations of the large-scale 
fading coefficients { 7 a = 4. P/No = lOdB. K = 50. (a) L = 100. (b) L = 1000. 


by combining ( [IT] ) and ( |T3jfT6| ), the achievable ergodic rate of user k can be finally obtained as 

P ~F ^ (7.7>i) ^T?",7 k,r 


*=E exp 

zeVfe 


7 l,i 


E l 


P + Thm&Vj ^ ( t i,m'i { 7 ' 7 k,-, 

7 l,l 


q&V k ,q^l ^ 


J k ’ q -2 1o §2 e > 


(17) 


where E 1 {x} = / x °° 

It is clear from ( |T7| ) that the achievable ergodic rate of user k is a function of the large-scale 
fading coefficients of all the users, i.e, {7j,z}jGA:,ZGB- Fig. [5] presents the simulation results of the 
achievable ergodic rate Rk of a randomly selected user k given 3 random realizations of the 
large-scale fading coefficient^] {ajj Ijg/c./gb with the total number of BS antennas L = 100 and 
1000, and the total number of users K = 50. It can be clearly observed from Fig. [2] that ( fT7] ) 
is accurate and the rate performance of user k is sensitive to the large-scale fading coefficients 


Moreover, by comparing Fig. 2a and Fig. 2b we can see that on average, the 


3 More specifically, for each realization, we generate the positions of K users and L BS antennas which are supposed to be 
uniformly distributed in a circular area with unit radius, and then calculate the large-scale fading coefficients based on j3j. 
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achievable ergodic rate can be significantly improved by increasing the total number of BS 
antennas L. Yet the effect of the virtual cell size V on the achievable ergodic rate depends on 
the specific realization of the large-scale fading coefficients {Yy,«}ieAC,zei 3 - 

For networks with a large number of users, the average rate performance per user is an 
important performance metric. In our case, as the achievable ergodic rate of each user is closely 
dependent on the large-scale fading coefficients { 7 in the following, we would focus 
on the average achievable ergodic rate (which is referred to as “average user rate”) defined by 

R - [ R k] ■ ( 18 ) 

We are specifically interested in the effect of the virtual cell size V on the average user rate R. 


B. Effect of Virtual Cell Size V on Average User Rate R 

The achievable ergodic rate Rk of user k is derived in ( fTTj ) as an explicit function of the large- 
scale fading coefficients of all the users {'Yj,i}jeic,ieB- With BS antennas uniformly distributed 
within a circular area, the pdf of the large-scale fading coefficient from a given user to a BS 


antenna has been derived in [21]. Nevertheless, it is difficult to obtain the joint pdf of all the 
large-scale fading coefficients {Aj,i}j&K,ieB especially when the number of BS antennas L and 
the number of users K are large. Therefore, we resort to an upper-bound to study the effect of 
the virtual cell size V on the average user rate R in the following. 

1) An Upper-bound of Average User Rate R ub : According to Jensen’s inequality, the achiev¬ 
able ergodic rate of user k given in ( fl3| ) is upper-bounded by 

Rk < log 2 (1 + Rk) ■ (19) 


Then the average user rate R defined in (18) is upper-bound by 


R < [log 2 (1 + R k )]. (20) 

It is clear from ( fl4j ) that the average received SINR Rk is determined by two parts: (a) the 
normalized average received signal power 

S k = \hk,v k \\ 2 , (21) 
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and (b) the normalized interference power 


h- £ £ T (7 3 -, 

jcKJ^k l&V j 

which is lower-bounded by 


eVj ) ' 7k,ii 


( 22 ) 


h > Ik — ^ ' 7k,i, 


(23) 


lev,* 


where j* denotes the closest interfering user of user k. An upper-bound of the average user rate 
can be then obtained by combining ( fl4| ) and ( |2()|]23j ) as 

R < R ub = Er 






!°g2 l 1 +Jfb 


(24) 


Appendix [b] further shows that the upper-bound R ub can be obtained as 

2 y+1 ,„ ., f L\ \ r /. ox k-2 / r r r’~* r‘ 2 \l-v 


R 


ub 


7r 


v 


<K- 1 ) 


2 r 1 ,, „ r 1 rw rw-i 

{L -V,J Jo 



CV2 


1 0 


(i-yR 


n' 

2—1 


1 pxy rx v ~l 



rx 2 


0 JO JO 


(l - Xy) L V \\xi 


V-fold 

1 /*27r /*27 t /*27t 


i= 1 


'0 JO 


'0 


V-fold 


V-fold 


V _ —c 


log 2 1 + 


zu* 


EL t (Vi 2 ;i/i %••• ^i-vVi+iv ,y v 2 ) ■ (yf + z 2 + 2 Vi zcosuji) 


Ct Oi 


a. a 


du\du 2 ■ ■ ■ du>vdxidx 2 ■ ■ ■ dxydy\dy 2 ■ ■ ■ dy v dz, 


(25) 


where T(x; &i, fe 2 ,• • • , by- 1 ) is given in (15) 


Fig. [^presents the upper-bound of the average user rate R" h given in (25) under various value^] 
of the virtual cell size V with the number of users K = 50 and the number of BS antennas 
L = 100 and 2500. For the sake of illustration, the average entropy of the normalized received 
signal power E{ 7 . t y K l fl . log 2 S), and the average entropy of the normalized interference power 


lower-bound E{ 73 . ( } . eJC ieB log 2 l£, which are given in (47) and (48) in Appendix [bJ respectively, 


4 Note that R ub is derived in |25) as a 31Z-fold integral. Although the numerical calculation of multi-dimensional integration 
can be effectively performed by using numerical softwares such as Matlab, the computational complexity sharply increases with 
V. Therefore, we only present the results of R ub for V < 10. 
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(a) (b) 

Fig. 3. Upper-bound of the average user rate R ub versus virtual cell size V. (a) L = 100. (b) L = 2500. a = 4. P/No = lOdB. 


K = 50. 


are also plotted in Fig. 3 It can be clearly seen from Fig. 


3a 


that both E 




log 2 S k 


and KleB \og 2 If increase with the virtual cell size V and converge when V is large. 

Intuitively, with a large virtual cell size V, more BS antennas are included for each user’s signal 
transmission, and thus both the received signal power and the interference power are increased 
on average. When V is large enough, however, the newly added BS antennas in each user’s 
virtual cell are far away from the user with negligible contributions under MRT. As a result, 
the effect of V on the rate performance becomes marginal. As we can see from Fig. [3a| the 
upper-bound of the average user rate significantly varies with the virtual cell size V only when 
V is small. 

It can be also observed from Fig. [3]that there exists an optimal virtual cell size V* to maximize 
the upper-bound of the average user rate, and V* depends on the total number of BS antennas 


L. Specifically, with L = 100 in Fig. 3a R ub is maximized when V — 1. When L increases to 


2500, it can be seen from Fig. 3b that the optimal virtual cell size V* becomes larger than 1. 
As the optimization of R ub , which is obtained in (25) as a multi-fold integral, is intractable, we 


will develop a rule of thumb for choosing the optimal virtual cell size V* in the next subsection. 
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Y 


Fig. 4. Illustration of the radiuses of the virtual cells of user k and its closest interfering user j*. x represents a user and Y 
represents a distributed BS antenna. 


2) A Rule of Thumb for Optimal Virtual Cell Size V*: To visualize the effect of the virtual cell 
size V on the rate performance, let us first define the radius of a user’s virtual cell as the distance 
from this user to its V"th closest BS antenna. Fig. [4] illustrates the radiuses of the virtual cells 
of user k and its closest interfering user j*{^] Intuitively, as the virtual cell size V increases, the 
radius of user k’s virtual cell r k becomes larger, indicating that more BS antennas are included 
into user k’s virtual cell and thus the average received signal power of user k increases. On 
the other hand, with a larger V, the radius of user j*’s virtual cell r ? « would also increase, 
with which more BS antennas transmit to user k’s closest interfering user j*, leading to higher 
interference for user k, as Fig. [3] shows. If V is too large, the virtual cells of user k and user j* 
would overlap with each other, in which case user k would suffer from strong interference from 
user j* due to the shared BS antennas. Therefore, to reach a fine balance between the average 
received signal power and interference, we propose to choose the largest virtual cell size without 

5 Similar to the derivation of the upper-bound R u , here we only consider user k’s closest interfering user j* whose signal 
contributes the most to the interference of user k. 
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causing an overlap of user k and user j*’s virtual cells. That is, 


max V 


s.t. ('k T j* ^ dk,j*, 


(26) 

(27) 


where dkj* is the distance between user k and user j*. As we focus on the average user rate, 
we further relax the constraint to 


E 


bj ,d. 




[r k + r r \ < d k j* , 


(28) 


where d Kj * = E {7 . il} . eKil6B [d Kj *]. 


Appendix [C] shows that the solution of the optimization problem defined in (26) and (28) is 
given by 

^ = I ((Fip (“ (/f - + 5 r G’°) - 5 r ~ 2 


for large K L 

“ °' 2 k' 


(29) 


where T(s, x) = 1 e t dt. (29) indicates that the optimal virtual cell size 17* is solely 


determined by the ratio of the number of BS antennas L to the number of users K. Intuitively, 
with an increasing number of BS antennas L, more BS antennas should be included in user k 's 
virtual cell to improve the average received signal power. In contrast, if the number of users K 
increases, the virtual cell size V should be reduced to avoid the overlap of different virtual cells. 
Note that the size of each virtual cell should be an integer and no smaller than 1. Therefore, 


(29) should be further refined as 


V* = 


L 

°' 2 k 


(30) 


3) Simulation Results: Fig. [^presents the simulation results of the average user rate R when 
the ratio of the number of BS antennas L to the number of users K is fixed at 2, 20 and 50. For 
demonstration, the upper-bound of the average user rate R ub given in (25) is also plotted. It can 


be clearly seen that R varies with the virtual cell size V in the same manner as its upper-bound 
R ub . That is, it monotonically decreases as V increases when the ratio L/K = 2. If L/K = 20 
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Fig. 5. Average user rate R versus virtual cell size V. For demonstration, the upper-bound of the average user rate R u given 


1 25 i is also plotted, a = 4. P/N 0 = lOdB. K = 50. L/K = 2, 20, 50. 


or 50, in contrast, it first increases with V and then decreases. It can be also observed from Fig. 
[5] that the rule of thumb developed in ( |3(7| ) provides a good estimation for the optimal virtual 
cell size. According to ( [30] ), the optimal virtual cell size V* needs to be enlarged as the ratio 
of the number of BS antennas L to the number of users K increases. Fig. [5] corroborates that 
as L/K increases from 2 to 50, the optimal virtual cell size to maximize the average user rate 
should be increased from 1 to 10. 

It can be also seen from (30) and Fig. [5] that with a large number of users, the optimal 
virtual cell size is much smaller than the total number of BS antennas L, which indicates that 
compared to a joint transmission from all the BS antennas, choosing a small number of adjacent 
BS antennas not only greatly lowers the complexity, but also leads to superior rate performance. 
Moreover, ( |30| ) shows that selecting the closest BS antenna, i.e., V — 1, is optimal only when 
the ratio L/K is small. For the future 5G mobile communication system which is expected to 
deploy a high density of BS antennas, multiple BS antennas would need to be selected to form 
each user’s virtual cell. 
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Fig. 6. Illustration of BS clustering. The BSs are divided into 4 clusters with 10 BSs in each cluster, x represents a user and 
o represents a BS. The cell edge is plotted in dotted lines and the cluster edge is plotted in solid lines. L = 40. K = 20. 


IV. Rate Performance with User Grouping 

In Section III, we have analyzed the average user rate when no cooperation is adopted among 
different users’ virtual cells. In that case, each user suffers from considerable interference, which 
greatly degrades the rate performance. In this section, we further consider grouping multiple users 
together with coordinated transmission from their virtual cells. 

A. Virtual-Cell based User Grouping 

As we mentioned in Section I, for cellular systems with CoMP transmission, multiple BSs are 
normally clustered so as to improve the rate performance of cell-edge users. As we can see from 
Fig. [6j in this case, users located at the cluster edge still suffer from much degraded performance 
due to strong interference from BSs in neighboring clusters. The edge effect indeed originates 
from the BS-centric structure. As different users could be geographically close to different sets 
of BSs, clustering BSs would always lead to a few “unlucky” ones whose closest BSs do not 
belong to the same cluster and thus cannot coordinate their transmissions. To ensure that every 
user is served by its best BS set, the grouping should be user-centric, and the information of 
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Algorithm 1 Virtual-Cell based User Grouping 
l: Initialization: Each user k E /C selects its V closest BS antennas to form its virtual cell V/,.. 

m = 0 . 

2: while /C 7^ 0 do 

3: V any user k G 1C, set the user group JC m = {/;:} and the BS antenna set B rn = V/.. 

Delete user k from 1C. 

4: while B m fl (Ujyfcje/cVj) 7^ 0 do 

5 : for j E /C do 

6: if V t n B, n 7^ 0 then 

7 : j 6 /C m and £> m = U V ; . Delete user j from 1C. 

8 : end if 

9: end for 

to: end while 

it: m — m + 1. 

12: end while 


users’ virtual cells should be fully utilized for user grouping. 

More specifically, in a virtual-cell based DAS, since each user forms its virtual cell by selecting 
V closest BS antennas, users who are close around tend to choose the same, or partially the same, 
BS antenna set as their virtual cells, and cause strong interference to each other if no coordination 
is performed among their virtual cells. Therefore, we propose the following virtual-cell based 
user grouping algorithm: For any users k,j G /C, if their virtual cells overlap with each other, 
i.e., 14 fl Vj 7^ 0 , then group them together and merge their virtual cells. Repeat this step until the 
whole user set is divided into disjoint subsets. The detailed description is presented in Algorithm 

ED 

Fig. [7] presents the grouping results by applying the proposed virtual-cell based user grouping 
algorithm to the topology given in Fig. [6j It can be clearly seen that in contrast to Fig. [6] where 
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Fig. 7. Illustration of user grouping results with different values of virtual cell size under the same topology as Fig. El (a) 
V = 2. (b) V — 3. x represents a user and o represents a distributed BS antenna. Different user groups are separated by solid 
lines. L = 40. K = 20. 


the cluster-edge users suffer from strong interference from adjacent BSs, every user is always 
served by at least V closest BS antennas. By doing so, all the users achieve relatively uniform 
rate performance with the lowest rate significantly improved compared to the BS clustering case, 
as we will demonstrate in the following subsection. 

B. Achievable Ergodic Rate /?/. with ZFBF 

Let us first extend the system model proposed in Section II to incorporate user grouping. 
Denote K, m as the user group that user k belongs to, and B rn as the corresponding BS antenna 
set serving the users in JC m , i.e., B rn = U jeJc m Vj. The received signal of user k can be then 
written as 

Vk — + ^2 g + ^2 + n k- (31) 

j£lCm,jAk j&Kt,t^m 

In contrast to 0. the second item of ( |3Tj ) denotes the intra-group interference, which can be 
effectively suppressed by coordinating transmissions of the virtual cells of users in the same 


group. In this section, we assume that ZFBF 08] is adopted in each user group to eliminate the 
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Fig. 8. Achievable ergodic rate of each user Rk with virtual-cell based user grouping. The x-axis denotes the index of a user. 
a = 4. P/No = lOdB. K = 20. L = 40. 


intra-group interference. For each user group 1C m , let G/c m 6 ,r|A,„|x|& m denote the channel gain 
matrix from user group JC m to its serving BS antenna set B m . F/c m denotes the pseudo-inverse 
of G Km , i.e, Fic m = Gj Cm (Gyc m Gj Cm )" 1 . The precoding vector w k)Bm for any user k G /C m can 
be then written as 


tfc.JC,, 


w k,B m ~ 


z k,Kn 


0 


|Bm|xl 


if \Bm\ > \Kr 

otherwise, 


(32) 


where h.x rn ' s the column vector of F K : rn corresponding to user k. Note that in (32), w t , em ^ 0 
only if \B m \ > |/C m | because ZFBF requires that the number of transmit antennas is no smaller 
than the number of receive antennas. The achievable ergodic rate of user k 6 JC m is then given 
by 


Rk = Eh.- 


log 2 1 + 




(33) 


N 0 + I k 

where H^; m e cl^m|x|B m | denotes the small-scale fading matrix from user group /C m to its serving 


BS antenna set B m , and the interference power I k is given by I k = X! 




E 




P. 
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Fig. 9. Average user rate R with virtual-cell based user grouping versus virtual cell size V. a = 4. P/No = lOdB. K = 50. 
L/K = 2,20. 


Fig. [8] illustrates the achievable ergodic rate of each user under the topology given in Fig. 
[7] For the sake of comparison, the achievable ergodic rate of each user with the BS clustering 
shown in Fig. [6] is also presented^] It can be clearly seen that the cluster-edge users in the BS 
clustering case achieve significantly lower rates than others. In contrast, with the virtual-cell 
based user grouping, the rate difference is greatly reduced, indicating that the rate performance 
of each user becomes much less sensitive to its position. It can be also observed from Fig. [8] 
that by increasing the virtual cell size V, not only is the lowest rate improved, but also the user 
rate on average. In the next subsection, we will further study the effect of virtual cell size V on 
the average user rate. 

C. Effect of Virtual Cell Size V on Average User Rate R 

Similar to Section III, as the achievable ergodic rate of each user is closely dependent on their 
large-scale fading coefficients, we focus on the average user rate which is defined in ([T8|). Fig. 


6 In this case, ZFBF is adopted in each BS cluster for joint transmission. 
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[9] shows the average user rate with the proposed virtual-cell based user grouping. For the sake 
of comparison, the rate performance without user grouping is also presented. It can be clearly 
seen from Fig. [9] that in contrast to the no-grouping case where the highest average user rate is 
achieved when the virtual cell size V is small, here the average user rate can be significantly 
improved with an increase in V. 

Intuitively, by including more BS antennas into each user’s virtual cell, more users would 
have overlapped virtual cells and be grouped together according to the virtual-cell based user 
grouping algorithm. With more users jointly served by using ZFBF to eliminate the intra-group 
interference, each of them achieves a higher rate thanks to a lower interference level. When the 
virtual cell size V is large enough, nevertheless, all the users are grouped together, and the rate 
thus becomes saturated, as shown in Fig. [9j We can then conclude that here the virtual cell size 
V determines a rate-complexity tradeoff: with a larger V, the average user rate is improved at 
the cost of a higher signal processing complexity as more users need to be jointly served. 

V. Conclusion 

In this paper, the effect of virtual cell size on the average user rate of a large-scale downlink 
DAS is studied under different precoding schemes. The analysis shows that if MRT is adopted 
in each user’s virtual cell, a small virtual cell size should be chosen so as to avoid sharing BS 
antennas for different users which would otherwise cause strong interference. On the other hand, 
if users are grouped with joint ZFBF transmission from their virtual cells to eliminate the intra¬ 
group interference, the average user rate could be significantly improved by increasing the virtual 
cell size. A novel virtual-cell based user grouping algorithm is proposed, with which the rate 
difference among users is greatly reduced compared to the conventional BS-centric clustering. 

Note that despite the simplicity in concept, the group size, i.e., the number of users in each 
group, with the proposed virtual-cell based user grouping algorithm could be highly unbalanced if 
users’ spatial distribution is nonuniform. For practical scenarios where users tend to congregate 
at some hot spots, it is important to further develop user grouping algorithms under certain 
complexity constraints such as the maximum group size. Moreover, in this paper, each user is 
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assumed to select an identical number of closest BS antennas to form its virtual cell, which, 
though easy to implement in practice, may not be the optimal way for rate maximization. A global 
optimal antenna selection for all the users, on the other hand, would lead to prohibitively high 
computational complexity for a large-scale DAS. How to decompose the original combinatorial 
problem based on the concept of virtual cell is an interesting and challenging issue, which 
deserves much attention in the future study. 


Appendix A 
Derivation of ( fT4| ) 

By substituting ([5]) into ([8]), the average received SINR ///,. can be written as 

„ Il'Tfc.Vfc || 2 

/ifc — 


(34) 


p + Ylj£K,j^k Xwey,- a il ' 7 l,i 

If the virtual cell size V — 1, it is clear from (|6]) and ( [I2| ) that a J; i = 1. If V > 1, by following 

a similar derivation to Appendix A in p4|, a 3 j can be obtained as 


— XX 

mGVj ,m^l 


Pj,lPj,m ( lo 'gPj,? ~ lo § Pj,m ~ 1 )+P, 


-4 

j,m 


which can be further written as 

( lo S7yz 2 - - !) + 7 


n 


p. 


-2 


R ~ 2 _ R ~ 2 ’ 
y 3’ m 


(35) 


a j,i — X! 

m&Vj ,m^l 


-4 


( t jf 7 j,m) 'j’t '3’ m 

according to ( [TT] ). < [T4| ) can be then obtained by combining ( [34] ) and ( |36] ). 


n 


1 it 


-2 -2 ’ 
“7 i 


(36) 


Appendix B 


Derivation of (25) 


According to (24), the upper-bound of the average user rate R ub is determined by the nor¬ 
malized average received signal power S k = \\jk.v k II 2 and the lower-bound of the normalized 
interference power I lb = T (tt ,b brAi^Vt*) ' 7 l,i- Let 


d, ,(i) < d (2) < • • • < d (l) (37) 

" J,L k ™ ,L k 

denote the order statistics obtained by arranging the access distances 7/,j, d k}2 , • - • , d k)L of user 
k to L BS antennas, where b denotes the /1h closest BS antenna of user k. Then the normalized 
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BS antenna / ( ! ) 



Fig. 10. Graphic illustration of ( (i>. 

’ j* 


average received signal power S k and the lower-bound of the normalized interference power I k 
can be written as 


w 


S * = XX b 

i=l 


and 


(38) 


i ?=J2 r ( d ^ d 


i= 1 


A* /(!)’ ' ' ’ /(*-!)’ “■* /(* + !)’ 

J J J 4 ,* J 


. d 2 d 2 

’ a •* 7(i-l) ’ „•* i(*4 


, d. 


2 

* »(V) 
_•* 


j-a 

J 


(39) 


respectively, according to (j3j), where T (x; 6i, b 2 , ■ • • , i) is given in (15) and d k ^ is the 
distance from user k to the /th closest BS antenna of user j*. Let d k j* denote the distance from 


user k to user j*. It can be easily obtained from Fig. 10 that 


d k,i (i J 

’ j* 


J d lj* + d \ ,« + 24j*gL ,w cos^. 

V J ' i* ’ i* 


(40) 


Let f d (z) and f Vi (u>i) denote the pdfs of d kij * and ip u respectively, and f d d d 

,J k.l) ’ k.l) ’ k.l) ’ 

k k k 

(xi,x 2 , ■ ■ ■ , xy) denotes the joint pdf of < d, m) > . The upper-bound of the average user 

l J i=i,-,v 
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rate R ub can be then written as 

yub 


RU = E {yi,lh&C,l £ B lo S2 


S k 

i + 4 
il b 


•i 


i rw rV2 


= I fd kJ *(z) 



fd 


0 J 0 


’ 3 * 3 * 3 


,-,d (V) ( 2 / 1 ) 2 / 2 , • • • ,yv) 


V-fold 


1 fXy rx2 



<•2 it /*27 t /*27 t V 


0 JO 


fd (i ),d, ,( 2 ).-,£* ,(V) ( X 1’ X 2, • • • 


'0 ao 


n/«(«*) 


-V- 

V-fold 


-v- 

V-fold 


2=1 


log 2 1 + 


EL*r 


EL t (Vi 2 ; 1 / 1 2 ,• • •, Vi-l^Vi+i,■■■ ,y v 2 ) ■ (z 2 + y 2 + 2 yi z cos u t ) 


Ct Oi 


a a 


du\du 2 ■ ■ ■ dujydxidx 2 ■ ■ ■ dxydy\dy 2 ■ ■ ■ dy v dz, 


( 41 ) 


by combining (24) and (38 401). Similarly, we have 


®{7fe,i}fe6/C,isB 1°§2 R k 


1 j-Xy fXy -1 pX2 ( y 

■ L l0 S2 ( I>E ] fd ki yyd k/ 2y---4 k/ vM^ X ^--- 1 X V) 



0 JO JO 


, i =1 


V-fold 

dxidx 2 ■ ■ ■ dxy , 


(42) 



V-fold 

a \ 

• (y 2 + x 2 + 2xiy cosccj) 2 j rice 1 dec 2 ■ • • dujydxidx 2 ■ ■ ■ dxydy. (43) 


Let us ignore the edge effect and assume that an infinite number of BS antennas and users 
are uniformly distributed in an infinitely large plane with L BS antennas and K users in a disk 
with radius 1. It can be easily obtained that the pdf and the cumulated density function (cdf) of 
the access distance from user k to BS antenna l are given by fd kl {x ) = 2x and F dkl {x) = x 2 . 
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V 


respectively. We then have 

fd a) ,d (C2)1 - ,d fri, ® 2 , • • ■ , Xv) = JT~(7V\ (! - x v) L ~ X n 2xi ' Xl ~ X2 ~ ''' - Xv "’ 

k,i k k,L k y j. 

(44) 

fd^Jx) = (K - 1)(1 - x 2 ) K ~ 2 ■ 2x. (45) 

and 

U,i x i) = 7^- (46) 


(25) can be then obtained by substituting (44 46) into ( |4Tj ). 


Moreover, we have 

®{7fe,i}feeK,isS 



£*** (y>r) (1-4)" 


j~^J Xidxid,X 2 ■ ■ ■ dxy, 


(47) 


1=1 


and E { 7 od fce /c, ie s lo S 2 Ik can be written as 
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by substituting (44 46) into (42) and (43), respectively. 


Appendix C 


Derivation of (29) 


Let us rewrite the optimization problem formulated in (26) and (28) as 

max V 

s.t. f k + fj* < d kJ *, 


(49) 

(50) 
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where r k = E {7j . l}jeK ^ B [r k ] and r r = E { ^ l}jelc ^ B [r r ]. As L BS antennas are uniformly 
distributed, we have 


-2 -2 

r k = r j* = —■ 


V 

T 


(51) 


The constraint (50) can be then written as 




(52) 


It is clear from (49) and (52) that the optimal virtual cell size V* is 

V‘ = j (*,)•) 2 ■ 


(53) 


Note that the pdf of the distance from user k to its closest user j*, d k ,j*, has been given in (45). 
We then have 
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d k j* — 2 
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where T(s, x) = f£°t s x e t dt. Note that for large K, (K — 2)ze ^ ~ 0, T K — 2) ps 0 
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-j=. (54) can be then approximated as 
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Finally, (29) can be obtained by combining 
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